Tolerating Faults in Counting Networks
نویسنده
چکیده
Counting networks were proposed by Aspnes, Herlihy and Shavit 4] as a technique for solving multiprocessor coordination problems. We describe a method for tolerating an arbitrary number of faults in counting networks. In our fault model, the following errors can occur dynamically in the counting network data structure: 1) a balancer's state is spuriously altered, 2) a balancer's state can no longer be accessed. We propose two approaches for tolerating faults. The rst is based on a construction for a fault-tolerant balancer. We substitute a fault-tolerant balancer for every balancer in a counting network. Thus, we transform a counting network with depth O(log 2 n), where n is the width, into a k-fault-tolerant counting network with depth O(k log 2 n). The second approach is to append a correction network, built with fault-tolerant balancers, to a counting network that may experience faults. We present a bound on the error in the output token distribution of counting networks with faulty balancers (a generalization of the error bound for sorting networks with faulty comparators presented by Yao & Yao 21]). Given a token distribution with a bounded error, the correction network produces a token distribution that is smooth, i.e., the number of tokens on each output wire diiers by at most one (a weaker condition than the step property). In order to tolerate k faults, the correction network has depth O(k 2 log n) for a network of width n.
منابع مشابه
Design of Two-Level Fault-Tolerant Networks
Some new techniques for the synthesis of fault-tolerant two-level combinational networks are presented. Two classes of faults are defined, 1) critical faults and 2) subcritical faults. Critical fauls are the class of faults that cannot be tolerated by any two4evel networks. Necessary conditions for synthesis of networks tolerating subcritical faults are developed. As a result it is established ...
متن کاملFault-Diagnosis in a Multiple-Path Interconnection Network
Annotation: Two pass routing scheme is described for communication in a multiprocessor system employing a unique-path multistage interconnection network in the presence of faults in the network. It is capable of tolerating all single faults and many multiple faults in all except the first and last stages of the network. The routing scheme is useful for tolerating both permanent as well as inter...
متن کاملOn the Fault Tolerance of Some Popular Bounded-Degree Networks
In this paper, we analyze the fault tolerance of several bounded-degree networks that are commonly used for parallel computation. Among other things, we show that an N-node butterry network containing N 1? worst-case faults (for any constant > 0) can emulate a fault-free butterry of the same size with only constant slowdown. The same result is proved for the shuue-exchange network. Hence, these...
متن کاملTolerating Faults in a Mesh with a Row of Spare Nodes
Bruck, J., R. Cypher and C.-T. Ho, Tolerating faults in a mesh with a row ofspare nodes, Theoretical Computer Science 128 (1994) 241-252. We present an efficient method for tolerating faults in a two-dimensional mesh architecture. Our approach is based on adding spare components (nodes) and extra links (edges) such that the resulting architecture can be reconfigured as a mesh in the presence of...
متن کاملOn the Fault Tolerance of Some
In this paper, we analyze the fault tolerance of several bounded-degree networks that are commonly used for parallel computation. Among other things, we show that an N-node butterry network containing N 1? worst-case faults (for any constant > 0) can emulate a fault-free butterry of the same size with only constant slowdown. The same result is proved for the shuue-exchange network. Hence, these...
متن کامل